Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models.

نویسندگان

  • Ronald Jansen
  • Harmen J Bussemaker
  • Mark Gerstein
چکیده

Highly expressed genes in many bacteria and small eukaryotes often have a strong compositional bias, in terms of codon usage. Two widely used numerical indices, the codon adaptation index (CAI) and the codon usage, use this bias to predict the expression level of genes. When these indices were first introduced, they were based on fairly simple assumptions about which genes are most highly expressed: the CAI was originally based on the codon composition of a set of only 24 highly expressed genes, and the codon usage on assumptions about which functional classes of genes are highly expressed in fast-growing bacteria. Given the recent advent of genome-wide expression data, we should be able to improve on these assumptions. Here, we measure, in yeast, the degree to which consideration of the current genome-wide expression data sets improves the performance of both numerical indices. Indeed, we find that by changing the parameterization of each model its correlation with actual expression levels can be somewhat improved, although both indices are fairly insensitive to the exact way they are parameterized. This insensitivity indicates a consistent codon bias amongst highly expressed genes. We also attempt direct linear regression of codon composition against genome-wide expression levels (and protein abundance data). This has some similarity with the CAI formalism and yields an alternative model for the prediction of expression levels based on the coding sequences of genes. More information is available at http://bioinfo.mbb.yale.edu/expression/codons.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Revisiting the CAI from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models

Highly expressed genes in many bacteria and small eukaryotes often have a strong compositional bias, in terms of codon usage. Two widely used numerical indices, the codon adaptation index (CAI) and the codon usage, use this bias to predict the expression level of genes. Both indices are based on fairly simple assumptions about which genes are most highly expressed, which were known when they we...

متن کامل

Identification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene

Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...

متن کامل

Codon bias patterns in photosynthetic genes of halophytic grass Aeluropus littoralis

Codon bias refers to the differences in the frequency of occurrence of synonymous codons in coding DNA. Pattern of codon and optimum codon utilization is significantly different between the lives. This difference is due to the long term function of natural selection and evolution process. Genetics drift, mutation and regulation of gene expression are the main reasons for codon bias. In this stu...

متن کامل

P-128: Optimization of Human LH Gene Expression by Codon Usage Adaptation in CHO Cell Line

a:4:{s:10:"Background";s:897:"Human luteinizing hormone (hLH) belongs to glycoprotein hormones which is composed of two non-covalently linked subunit, α and β. The α-subunit is similar in all glycoprotein hormones, whereas the β-subunit is conferring the hormonal specificity. This hormone has important roles in the growth and maturity of sexual organs and secondary sexual characteristics and st...

متن کامل

Codon 72 Polymorphism of p53 Gene and Hematologic Manifestations in Patients with Systemic Lupus Erythematosus

Background: Systemic lupus erythematosus is a systemic autoimmune disorder with unclear etiology. The importance of some genes in the development of systemic lupus erythematosus has been implicated. The gene polymorphism in codon 72 has attracted a lot of attention and its role in the occurrence or progression of many cancers and autoimmune diseases especially systemic lupus erythematosus has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 31 8  شماره 

صفحات  -

تاریخ انتشار 2003